PATENT APPLICATION 



60194338 



Date 
Entered 

or 
Counted 



APPROdfelFOR LICENSE Q 



JC553 U.S. P.TO 



04/O3/OD 



CONTENTS 



>7 



Date 
Received 

or 
Mailed 



r- i.:t'i.y.';v-j- v ' ; - " ■ 




\.Appljcath 
2. 
3. 
4. 
5._ 
6 

7._ 

8,. 
9.. 
10.. 

11-. 
12.. 
13.. 
14.. 
15.. 
16.. 
17. 
18. 
19.. 
20.. 
21.. 
22.. 
23. 
24. 
25. 
26. 
27. 
28. 
29. 
30. 
31. 
32. 




^0 



(LEFT OUTSIDE) 



POSITION 


INITIALS 


ID NO 


nATP 

UM 1 C 










FEE DETERMINATION 








O.I.P.E. CLASSIFIER 








FORMALITY REVIEW 


J.-J''^ ' 




(,•>/ // v 


RESPONSE FORMALITY REVIEW 

















(LEFT INSIDE) 



Page 1 of 1 




United States Patent and Trademark Office 



commissioner for patents 
United States Patent and Trademark Office 

WASHINGTON, D.C. SO 231 

www.uspto.gca/ 



Bib Data Sheet 



SERIAL NUMBER 

60/194,338 



FILING DATE 

04/03/2000 
RULE 



CLASS 



GROUP ART UNIT 



ATTORNEY 
DOCKET NO. 



APPLICANTS 

Barry A. Bunin, San Bruno, CA ; 

"** CONTINUING DATA ************************* 

■** FOREIGN APPLICATIONS ******************** 

IF REQUIRED, FOREIGN FILING LICENSE 
GRANTED ** 06/07/2000 



' SMALL ENTITY 



Foreign Priority claimed 

35 USC 119 (a-d) conditions 
met 



Verified and 
Acknowledged 



□ y«Qno 

Allowance 



no W Met after 



Examiner's Signature 



Initials 



STATE OR 


SHEETS 


TOTAL 


INDEPENDENT 


COUNTRY 


DRAWING 


CLAIMS 


CLAIMS 


CA 

















ADDRESS 

Brian I Marcus Esq 
Fliesler Dubb Meyer & Lovejoy LLP 
Four Embarcadero Center Suite 400 
San Francisco ,CA 94111-4156 



TITLE 

System and method for obtaining disease target solutions 



FILING FEE 
RECEIVED 

100 



FEES: Authority has been given in Paper 

Mo. to charge/credit DEPOSIT ACCOUNT 

No. for following: 



□ AH Fees 



D 1.16 Fees (Filing) 



Q 1.17 Fees ( Processing Ext. of 
time ) 



D 1.18 Fees (Issue) 
□ Other 



□ 



Credit 



file://C:\APPS\PreExam\correspondence\l_A.xml 



9/19/00 



PATENT APPLICATION SERIAL NO. 



U.S. DEPARTMENT OF COMMERCE 
PATENT AND TRADEMARK OFFICE 
FEE RECORD SHEET 



04/10/2000 HHAPPY 00000051 60194338 

01 FC:114 75.00 OP 



PTO-1556 
(5/87) 



'U.S. GPO: 1899-459-082/1B144 



0 l/.^0</~ oO 



C-l 

O 




CO 



1 r4 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



=ft re Application 
Inventor: 
SC/Serial No.: 
Filed: 



Barry A. Bunin 

Unknown 

Herewith 



Title: SYSTEM AND METHOD FOR OBTAINING 
DISEASE TARGET SOLUTIONS 



PATENT APPLICATION 



w3 

m — 
incza 
invo 

O 



CERTIFICATE OF MAILING BY "EXPRESS MAIL" 
UNDER 37 C.F.R.§1.10 

"Express Mail" mailing label number: EL504217576US 
Date of Mailing: April 3, 2000 

I hereby certify that this correspondence is being deposited with the 
United States Postal Service, utilizing the "Express Mail Post Office to Addressee" 
service addressed to Box Provisional Patent Application, Assistant 
Commissioner for Patents, Washington, DC 20231 and mailed on the above 
Date of Mailing with the above "Exp^sJVlaiT mailing label number. 

— (Signatured 

Johann S. Mercado ^ 

Signature Date: April 3. 2000 




PROVISIONAL APPLICAT ION FOR PATENT TRANSMITTAL LETTER 

UNDER 37 C.F.R. S1.53(c1 



Box Provisional Patent Application 
Assistant Commissioner for Patents 
Washington, DC 20231 

Sir: 

ThisisarequestforfilingaPROVISIONALAPPLICATIONFORPATENTunder37CFR 
§ 1.53(c). 



Attorney Docket No.: BUNI-01000USO 
/bim/buni.001 



-1- 



Express Mail No. EL504217576US 
4/3/0-12:38 



INVENTOR(s)/APPLICANT(s): 

NAME RESIDENCE (City and Either State or Foreign Country) 

Bunin. Barry A. San Bruno, California 

Last, First'M.I. 

TITLE OF THE INVENTION (280 characters max): 

SYSTEM AND METHOD FOR OBTAINING DISEASE TARGET SOLUTIONS 



CORRESPONDENCE ADDRESS: 

Please direct all correspondence concerning the above- identified application to the following 
address: 

Brian I. Marcus, Esq. 
FLIESLER, DUBB, MEYER & LOVEJOY LLP 
Four Embarcadero Center, Suite 400 
San Francisco, California 941 1 1-4156 
Telephone: (415) 362-3 800 . 

Please direct all telephone calls to the undersigned attorney at (415) 362-3800. 

ENCLOSED APPLICATION PARTS (check all that apply): 

X Specification Number of pages: _3 Small Entity Statement. 

_ Drawings Number of sheets: _ _ Other (specify) 



METHOD OF PAYMENT: 

X_ A check in the amount of $ 75.00 to cover the filing fee ($75.00 for Small Entity/ $1 50.00 
for other than a small entity) is enclosed. 

_ The Commissioner is hereby authorized to charge underpayment of any additional fees or 
credit any overpayment associated with this communication to Deposit Account No . 06-1325 . 
A duplicate copy of this authorization is enclosed. 



Attorney Docket No.: BUNI-0I00OUSO 
/bim/buni.001 



-2- 



Express Mail No. EL504217576US 
4/3/0-14:9 



The invention was made by an agency of the United States Government or under a contract 
with an agency of the United States Government. 

X No. 

_ Yes, the name of the U.S. Government agency and Government contract number are: 



Respectfully submitted, 



Date: April 3. 2000 




FLIESLER, DUBB, MEYER & LOVEJOY LLP 
Four Embarcadero Center, Suite 400 
San Francisco, California 9411 1-4156 
Telephone: (415) 362-3800 



Attorney Docket No.: BUNI-01000USO 
/bim/buni.001 



-3- 



Express Mail No. EL5042I7576US 
4/3/0-12:38 



Express Mail Label No.: EL504217576US 



UNITED STATES PROVISIONAL PATENT APPLICATION 

FOR 

SYSTEM AND METHOD FOR OBTAINING 
DISEASE TARGET SOLUTIONS 

Inventor: 

Barry A. Bunin 



01/30/2000 05:25 510&490825 BUNINS PAGE 02 



'7 

^ ^^^^ 

Z far/ 2*4? ^ afyw 



01/30/2B00 05:25 5106490825 



BUNINS 



PAGE 03 





fid* ^fylooHts) 









Ucr ^^j *"^ J -y ^f^-Aj bo^Jt -frgt^ m.Jet,^ 



I. £y s lieSte use. r^U/h 1*0 



.^S H 'tArrfcJ 0$ S^'W.*^ 

/»«1<7« tM'fru; QsVfUlm* fatn»*b*f rOSS n/'rtmr™ ' 



fi[e:///c:/APPS/preexam/correspondence/4.htm 



FORMALITIES LETTER 

iiiiniiiiiiwniuuniii 

•OC0000000051 64000* 



APPLICATION NUMBER 


FILING/RECEIPT DATE 


FIRST NAMED APPLICANT 


ATTORNEY DOCKET NUMBER 


60/194,338 


04/03/2000 


Barry A. Bunin 





Brian t Marcus Esq 
Fliesler Dubb Meyer & Lovejoy LLP 
Four Embarcadero Center Suite 400 
San Francisco, CA 94111-4156 

Date Mailed: 06/07/2000 



NOTICE TO FILE MISSING PARTS OF PROVISIONAL APPLICATION 

FILED UNDER 37 CFR 1.53(c) 
Filing Date Granted 

An application number and filing date have been accorded to this provisional application. The items indicated below, 
however, are missing. Applicant is given TWO MONTHS from the date of this Notice within which to file all required 
items and pay any fees required below to avoid abandonment. Extensions of time may be obtained by filing a 
petition accompanied by the extension fee under the provisions of 37 CFR 1 .1 36(a). 

• The statutory basic filing fee is insufficient. 

Applicant must submit $ 75 to complete the basic filing fee and/or file a small entity statement claiminq such 
status (37 CFR 1.27). 

• To avoid abandonment, a late filing fee or oath or declaration surcharge as set forth in 37 CFR 1 .1 6(e) of 
$50 for a non-small entity, must be submitted with the missing items identified in this letter 

• The balance due by applicant is $125. 



A copy of this notice MUST be returned with the reply. 



Customer Service Center 

Initial Patent Examination Division (703) 308-1202 

PART 3 - OFFICE COPY 




UNITED STATES DEPARTMENT OF COMMERCE 
Patent and Trademark Office 

Address: COMMISSIONER OF PATENT AND TRADEMARKS 
Washington. D.C. 20231 



1 Of 1 



6/7/00 1:56 PM 



IN THE UNITED STATES PATENT AND TRADEMA RK OFFICE / ^ 




In re Application 

Inventor(s): Barry A. Bunin 

SC/Serial No.: 60/194,338 

Filed: April 3, 2000 

Title: SYSTEM AND METHOD FOR OBTAINING 
DISEASE TARGET SOLUTIONS 




PATENT APPLICATION 



CERTIFICATE OF MAILING UNDER 37 C.F.R. § 1 .8 
I hereby certify that this correspondence is being deposited in the 
United States Postal Service with sufficient postage as first class mail in an 
envelope addressed to Box Missing Parts, Assistant Commissioner for 
Patents, Washington, D.C. 20231, on Auoust 28. 2000. 

fJ - [^c^f-gui^. — /j»y» - (Attorney Signature) 

D. Benjamin Borson, Ph.D., Reg. No. 42,349 
Signature Date: August 28, 2000 



RESPONSE TO NOTICE TO FILE MISSING PARTS 

Box Missing Parts 

Assistant Commissioner for Patents 
Washington, D.C. 20231 

Sir: 

In response to the Notice to File Missing Parts of Application Filing Date Granted, 
dated June 7, 2000, enclosed are the following documents in connection with the above- 
identified application: 

X Copy of Notice to File Missing Parts - Filing Date Granted 
Declaration for. Patent Application 

Assignment and Assignment Recordation Form Cover Sheet 

Power of Attorney 

_X Statement(s) of small entity status 

_X Petition for an Extension of Time pursuant to 37 C.F.R. §1 .136(a) for responding 
to the Notice to File Missing Parts 

Information Disclosure Statement under 37 C.F.R. §1.56 



Attorney Docket No.: LBRA1006 SRM/DBB 
dbb/lbra/1 008/1 008. OOS.wpd 



- 1 - 



08/28/0-10:34 



Preliminary Amendment 



Total Fee 

The Total Fee associated with this communication has been calculated as shown 

below: 



_ Patent application filing fee $ 
X Net fee for extension of time $55.00 
_ Assignment recording fee ($40.00) $ 
X_ Surcharge under 37 C.F.R. §1.1 6(e) for 
late filing of filing fee or declaration: 
_ Large Entity $130.00 $ 
_ Small Entity $ 50.00 $50.00 



TOTAL FEE DUE: $105.00 



Method of Payment of Fees 

X A check in the amount of the TOTAL FEE DUE is enclosed. 

X The Commissioner is hereby authorized to charge underpayment of any additional 
fees associated with this communication or credit any overpayment to Deposit 
Account No. 06-1 325 . A duplicate copy of this authorization is enclosed. 

Respectfully submitted, 



Date: ^ u %mA\ 2g ; ^Q*t> 



FLIESLER, DUBB, MEYER & LOVEJOY LLP 
Four Embarcadero Center, Suite 400 
San Francisco, California 941 1 1-4156 
Telephone (415) 362-3800 



By: /^•/X<^W^y / S^>^- — ; 
D. Benjamin Borson, Ph.D. 
Reg. No. 42,349 



- 2 - 

Attorney Docket No.: LBRA1006 SRM/DBB 

dbb/lbra/1 006/1 006.005. wpd 08/28/0-10:34 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



SC/Serial No.: 



In re Application 



Filed: 



Inventor(s): 



April 3, 2000 



Barry A. Bunin 



60/194,338 




PATENT APPLICATION 



Title: SYSTEM AND METHOD FOR OBTAINING 
DISEASE TARGET SOLUTIONS 



CERTIFICATE OF MAILING UNDER 37 C.F.R. § 1 .8 
I hereby certify that this correspondence is being deposited in the 
United States Postal Service with sufficient postage as first class mail in an 
envelope addressed to Assistant Commissioner for Patents, Washington, 
D.C. 20231, on August 28, 2000 , 



In the Notice to File Missing Parts of Provisional Application dated June 7, 2000 , 
a shortened period for response was set to expire on August 7, 2000 . 

Pursuant to 37 C.F.R, §1.1 36(a), Applicant hereby petitions the Commissioner for 
an extension of time for responding to the Office Action up to and including 
September 7, 2000 . 
000 feiil-!^":'; >' ■^■'■rj'.-.r, ^UV'.'^f- 

- 1 - 

Attorney Docket No.: L8RA1006USO SRM/DBB 

dbb/lbra/1 006/1 006.006. wpd 08/28/0-1 0:34 




^ (Attorney Signature) 



D. Benjamin Borson, Ph.D., Reg. No. 42,349 
Signature Date: August 28. 2000 



PETITION FOR EXTENSION OF TIME UNDER 37 C.F.R. §1.136 



Assistant Commissioner for Patents 
Washington, D.C. 20231 



Sir: 



The amount of the petition fee set by 37 C.F.R. §1.17 is determined as follows: 



Fee (Large Entitv/Small Entity) Extended Month 

X_ $1 10.00/$55.00 First 

$380.00/$ 190.00 Second 

$870.00/$435.00 Third 

$1,360.00/$680.00 Fourth 

$1,850.00/$925.00 Fifth 



TOTAL PETITION FEE $ 55.00 

The TOTAL PETITION FEE is included with the payment of other papers filed 
together with this petition. 

The Commissioner is authorized to charge any underpayment or credit any 
overpayment associated with this communication to Deposit Account No. 06-1325. A 
duplicate copy of this authorization is enclosed. 

The other papers enclosed or associated with this communication include: 

X A Response to Notice to File Missing Parts of Provisional Application dated 
June 7, 2000. 

A reply to the Office action was previously filed on . 

A reply to the Office action is filed herewith. 

A reply will be filed subsequently. 

A continuation application is filed herewith. 

A divisional application is filed herewith. 

A continuation-in-part application is filed herewith. 

A continued prosecution application is filed herewith. 



Attorney Docket No.: LBRA1006US0 SRM/DBB 
dbb/lbra/1 006/1 006.008. wpd 



- 2 - 



08/28/0-10:34 



"Small Entity" status for this application has been established by a 

statement previously filed. 

X "Small Entity" status for this application is being established by a statement 

submitted herewith. 

Other: 
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ABSTRACT 



Chemical software tools employ databases and associated 
systems that store, manipulate, and investigate chemical 
information that is organized by reaction chemistry. Specific 
procedures and methods are associated with specific reac- 
tions. Further, such tools may associate reliability ratings 
with individual reactions to identify robust reactions from 
among groups of related reactions. For example, a particular 
benzyl amine may be given a high reliability rating because 
it is superior to other aromatic primary amines in its ability 
to form amides (the reaction chemistry under consideration). 
Further, the software tools may automatically suggest/gen- 
erate diverse libraries for particular precursors, classes of 
precursors, or reaction chemistries. This is accomplished by 
automatically generating a flexible group of reaction chem- 
istries based on like procedures and methods for a particular 
precursor or class of precursors. Preferably, these software 
tools are designed to allow continuous improvement and 
refinement by feedback from humans and/or artificial intel- 
ligence systems. 
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Lihraria's Broad Patent Application(s) Raw Information 



Patent application(s) tentative title: 

"Tying an Evolving Database of Chemical Information to Flexible Services" 

I. Introduction, Purpose, and Scope: 

A. The Purpose of this patent is to cover (but is not limited to): 

1 . The organization of the information 

2. The search of the information 

3 . The evolution of the information 

4. Tying chemical services to the information 

B . The field of "Combinatorial Chemistry" 

1 . History and relationship to all synthesis 

2 . Cu rrent state of the art 

3 . How tying flexible services to "The Electronic Combinatorial Index" 
format represents a novel, non-obvious product(s) 

C. The field of "Artificial Intelligence" 

1 . Previous applications to chemistry (ironically the Dendral project at 
Stanford applying AI to molecular structural identification based on mass 
spectroscopy data was one of the first expert systems published) 

2. Expert Systems 

a. Creation of databases 

b. Agents to search databases 

c. Recent advances (including web-industry technology, multi-agent 
teams, the human/computer interface, and inductive learning in 
flexible hypothesis space) 

D. The proposal is to build a system that grows information databases and searches 
information databases with a team of scientists and artificial intelligence 
capabilities starting with combinatorial chemistry (the most general and reliable 
synthetic methods) and having feedback loops that help both groups learn. At 
every stage when practical the information system will be tied to real services 
(for example, the synthesis of libraries and individual compounds). 

E. Network approaches and open systems for information and services 
R Possibility for multiple patents: 

1 . Patents related to the organization of chemical (and other) information 

2. Patents related to the Evolution of the organization of chemical (and 
other) information 

3 . Patents on the search capabilities of the chemical database 

4. Patents related to the organization of precursors and selection (i.e. 
software, sets" etc.) 

5 . Patents on the methods for tying flexible services to these patents 



II. Building a scaleable architecture for synthetic and related databases 

A. Current efforts with static databases 

B . Our potential architecture described in gory detail (while acknowledging the 
final product(s) will change, imagine each inner circle/database is a subset of the 
next larger circle graphically like a bullseye for darts): 

1 . Central position is "The Electronic Combinatorial Index" 

2. Second circle includes widely used chemistry texts and reference works 

3 . Third circle includes leading references from the second circle 

4. Fourth circle includes leading journals 

5 . Fifth circle includes chemical databases that are commercially available 

C. Create a simple, uniform format with actual procedures and reliability ratings 
that are simple enough for most humans to easily use yet robust enough for an 
artificial intelligence computer program to work with and evolve additional 
tasks, information, and networks 

D. Detailed description of a system that can evolve to include and work with: 

1 . Related fields such as structural biology and drug discovery 

2 . Unrelated fields such as economics and psychology database 

E. Examples of evolved utility with open systems 

1 . Sciquest and Chemdex offerings 

2. Web search engines 

3 . Prous science 

4. Dendral and meta-Dendral 

5. Aldrich chemical catalog 

6 . Cambridge software* s web portal 

III. Detailed description for tying chemical services to the information databases 

IV. Network approach with open systems to develop: 

A. New applicable chemistries 

B . . New AI applicable technologies 

C. Unique groups of experts to interact with the evolving database of information 
and to suggest improvements/additions after each use (perhaps for a discounted 
price or similar benefit, etc.)- 



I. Introduction, Purpose, and Scope: 

A. The Purpose of this patent is to cover (but is not limited to): 

The field of synthesis has evolved from observations and experiments of scientists for 
over a century. The information has been documented in the primary literature as well as a 
number of reference texts on synthetic methods. More recently combinatorial chemistry has 
rapidly developed as a subset of all synthetic chemistry particularly in the last decade. 
Combinatorial chemistry can be though of as the most general and Teliable methods for 
high-throughput synthesis. It often involves automation and has been likened to the 
industrial revolution finally meeting synthetic chemistry. The technology has increased the 
ability of a chemist to rapidly elucidate structure-activity relationships but has often been 
under exploited for a number of reasons. Primary roadblocks include the lack of sufficient 
information on reliable methods for library generation and the difficulties associated with 
optimizing reaction conditions to provide the desired products in high yield. These patent(s) 
describe how one can create, organize, search, evolve and actually use databases of synthetic 
information in conjunction with combinatorial chemistry services. 

B . Summary of Combinatorial Chemistry: 

Note: The following pages as background material are from the early chapters of my 
widely used text on combinatorial chemistry "The Combinatorial Index". 

"The goal of this book is to provide practical information about the combinatorial synthesis of 
small molecules for a range of individuals interested in the field, including bench chemists, lab managers, 
medicinal chemists, agrochemists, academic chemists, computational chemists, and even biologists. 
Regardless of one's experience in combinatorial chemistry, many questions inevitably arise, often simple 
questions such as: What has already been done? How was it done? Where can the necessary materials or 
instruments be found? With an explosion in information, the organization of the information becomes 
crucial. The Combinatorial Index provides a compilation of the synthetic information for constructing 
combinatorial libraries of small molecules. 

The synthesis and screening of libraries of nonbiological oligomers (see Appendix 1) and 
nonpolymeric organic compounds ("small molecules") have rapidly become the focus of intensive research 
efforts. The Combinatorial Index describes the methods reported in the literature for the rapid preparation 
of functionally diverse small molecules. Points of interest, representative examples, and literature 
procedures summarize each method. In addition to focusing on small molecules, The Combinatorial 
Index is a compilation derived exclusively from journal articles (i.e., patent literature is not included). 0 
Comments on the original studies are made; however, the majority of the information presented is factual 
and is thus reported as described in the primary literature. In cases where the copywritten material could 
not be obtained, the procedures were paraphrased as literature summaries. References are provided at the 
beginning of each new heading. Although this compilation provides an overview of the combinatorial 
synthesis of small molecules, it is primarily a resource book. 

Many combinatorial libraries are currently constructed using solid-phase synthesis. Therefore, 
following a brief introduction and background, three chapters are devoted to the different linkers, 
reactions, and analytical techniques for solid-phase synthesis. Although many reactions on support are 
high yielding, significant optimization is often required before they are efficient and general enough to be 
used to construct combinatorial libraries. It is hoped that providing examples of different reactions along 
with the specific conditions that were necessary for optimization will assist related combinatorial studies. 
Representative examples are given to help assess the generality and limitations of the different methods. 
Reliable methods for reactions such as substitutions, cyclizations, condensations, and Suzuki couplings are 
included specifically because they can be used in different contexts. 

Doing organic chemistry on solid support has been likened to working with a blindfold on 
because of the limited analytical and purification techniques available relative to those available in 
solution. This is one of the arguments in favor of building libraries in solution. The most direct way to 



evaluate the fate of a particular set of reaction conditions on support is to cleave the material off of support 
and rigorously characterize the products. Unfortunately, this is not always possible because intermediates 
are often unstable to cleavage conditions. Furthermore, particularly in a multistep sequence, there are 
often faster methods for determining whether a particular reaction worked (i.e., Fmoc quantitation). 
Detailed Procedure for a range of different quantitative and qualitative analytical methods for solid-phase 
synthesis are described in chapter 5. 

A growing number of reports on solution libraries have appeared in the literature. Criteria and 
methods for building solution libraries are discussed in Chapter 6. Whether a library is prepared on 
support or in solution is often dictated by the type of chemistry being developed (or vice versa, the type of 
chemistry being developed is often dictated by whether a library is prepared on support or in solution). 
High-throughput purification techniques such as solid-phase and liquid-phase extractions are often critical 
for preparing solution libraries that are useful for screening. The challenges associated with the 
construction of solution libraries (e.g., solubility and purification) can be quite different from the 
challenges associated with solid-phase combinatorial synthesis (e.g., monitoring reaction progress and 
scale up). Although there are many differences between the solution-phase and solid-phase strategies for 
generating libraries, in both cases the synthetic challenge is to develop reaction conditions that are general 
and high yielding. 

Often I have chosen to be inclusive, rather than selective, in citing examples from the literature. 
Even so, due to the large body of literature, many examples are cited in related studies. For example, in 
the sections discussing subjects such as the formation of amides and esters, only selected examples are 
provided for obvious reasons. I have made every effort to be both fair and informative about the strengths 
and weaknesses of the various methods in this compilation. I apologize for any topics that were 
underrepresented or misrepresented. I would appreciate being notified at bunin@combinatorial.com of 
new or incomplete information for incorporation into future editions or online supplements. 

A salient feature of combinatorial synthesis is that a large amount of diversity can be generated 
from a relatively small number of building blocks. A representative example of a simple combinatorial 
library prepared on solid support from three sets of building blocks, A, B, and C, is illustrated below. 
From only 10 derivatives of each 



building block, a library of 1000 trimers can be generated; with 100 derivatives of each building block, 
1,000,000 compounds can be accessed. With rapid access to such large numbers of compounds, new 
issues arise such as which compounds are the most useful to make and how to keep track of the large 
amount of information that is generated. 

Currently, there are a number of distinct approaches for generating combinatorial libraries in 
vitro. The compounds can be synthesized in a spatially separate format or as pooled mixtures. A number 
of methods for identifying active compounds in a mixture have been developed. Obviously, identifying an 
active compound is straightforward when the compounds are synthesized in a spatially separate format. A 
brief overview of the different methods for preparing synthetic combinatorial libraries follows. More 
detailed discussions can be found in a number of review articles. 1, 2 * 3 * 4 
Methods for Generating Combinatorial Libraries. 

A. Spatially Separate Synthesis. 

The most straightforward approach for library analysis is to keep the different compounds (or 
other variables) spatially separate in a parallel array. The primary advantage of keeping the compounds 
spatially separate is that it removes some of the ambiguities associated with pooling compounds. When 
the compounds are spatially separate, direct structure-activity relationships are obtained from biological 




X(A) + Y(B) + Z(C) 



X + Y + Z = Total number of variables used as inputs in the library 



(X)(Y)(Z) = Total number of compounds generated from the library synthes 



evaluation. Analytical evaluation of the chemical integrity of the compounds is also straightforward when 
the compounds are spatially separate. The primary disadvantage of spatially separate libraries is that the 
number of compounds that can be synthesized is more limited. 

The first combinatorial library was prepared in a spatially separate format by Geysen and co- 
workers in 1984. 5 They developed functionalized pins for solid-phase peptide synthesis and epitope 
analysis. The pins were configured to be compatible with 96-well microti ter plates. The pin technology 
has been improved using different polymers, as well as higher loading levels and functional linkers to 
accommodate other chemical applications 6 Fodor and co-workers at Affymax have developed 
photolithographic methods for building large libraries on a silicon wafer. 7 Large spatially separate 
libraries (100,000 compounds) can be prepared with this method. However, because it requires 
photolabile protecting groups and support-bound biological assays, the technology is primarily being 
applied to DNA diagnostic tests. 8 A number of new technologies for the preparation of spatially separate 
libraries on resin and in solution are currently being developed. 

B. Split Synthesis. There are a number of different pooling strategies. The earliest of these, 
developed independently by Furka, 9 Lam, 10 and Houghten, 11 employ a split and mix procedure to generate 
mixtures of peptides. In a split synthesis, a quantity of resin is split into equal-sized portions in separate 
reaction vessels and reacted with different monomers. After the reactions are complete, the resin is pooled 
together and thoroughly mixed. A common protecting group can be removed, or a common 
transformation can be performed, in a single reaction vessel. For the coupling of a second monomer, the 
resin is split again, and the process is repeated until the end of the combinatorial synthesis. To couple 
different building blocks, such as activated amino acids, the resin must be split into separate reaction 
vessels to allow reactions with different rates to be driven to completion. 12 

There are a number of techniques for identifying biologically active components from a library 
prepared by a split synthesis. The active components in a mixture can be isolated by deconvolution 
studies such as an iterative resynthesis and evaluation of smaller pools. A portion of the resin can be 
saved at each step to facilitate the iterative resynthesis. In addition, orthogonal, 13 positional, 14 and 
indexed 15 libraries all use pooling strategies that minimize the amount of deconvolution required. 

The combinatorial methods initially developed for peptide synthesis have also been applied to the 
combinatorial synthesis of unnatural biopolymers and small molecules. In one early example, high- 
affinity ligands to 7-transmembrane G-protein-coupled receptors (7TM/GPCR) were identified from the 
split synthesis of a diverse peptoid library. 16 

At the end of any split synthesis, each individual bead theoretically contains a single product, 
since all of the sites on any particular bead have been exposed to the same synthetic reagents. "One- 
compound, one-bead" approaches have been developed to identify the active components in a biological 
assay without resorting to a time-consuming iterative resynthesis. With certain assays of support-bound 
compounds, an active compound from a single resin bead is identified after it binds with a radiolabeled or 
fluorescent-labeled receptor. 17 After active components on support are detected and isolated, the chemical 
structure can be determined using a method such as Ed man degradation for the identification of support- 
bound peptides. Methods for the partial release of compounds off the support have been developed for 
biological evaluation in solution. After biological evaluation, the compound that remains on the resin 
beads can be used for structural, identification. 18 

A conceptually different approach to deconvoluting active components from a library prepared by 
split synthesis involves a molecular tagging scheme. In this approach, readable tags that encode the 
reaction sequence are attached to resin. DNA was an obvious choice for encoding, 19 since that is what 
Nature uses. Unfortunately, DNA is not chemically stable under many of the reaction conditions 
frequently used in organic synthesis. To circumvent this problem, encoding has been performed with 
peptides prepared from amino acids that have relatively unreactive side chains 17 or GC--EC tags that are 
inert to most of the reaction conditions typically employed. 20 The advantages of the GC-EC tags, 
developed by Still and co-workers, are that they can be both detected at less than 0.1 pmol and attached 
directly to polystyrene via carbene chemistry. Thus, the method does not require an orthogonal protecting 
strategy. Radiofrequency tagging strategies have also been developed as an alternative method for 
encoding libraries on resin. 21,22 Alternative approaches to generating combinatorial libraries and 
optimizing biological activity, such as genetic algorithms, are currently being investigated. 23,24 



At least as important as the format in which libraries are prepared are the classes of 
compounds that are accessible. This compilation describes synthetic methods and analytical 
techniques to assist in the development of chemistry for combinatorial libraries." (from 
Bunin, B. A. The Combinatorial Index. Ch. 1-2, 1998, Academic Press). 

By expanding the book to a suite of software products and services their is an 
opportunity to create rapidly accelerate drug discovery in a way that currently non-obvious 
to the ordinary chemist. A key component to this strategy will be the ability to expand the 
initial database on combinatorial chemistry to incorporate all synthetic chemistry. Another 
key component will be to incorporate flexible services as part of the software package. The 
way in which the database will evolve is the final critical component. This will tap heavily 
into related emerging fields of artificial intelligence. 

C. The field of "Artificial Intelligence" 

The field of Artificial Intelligence roots can be traced to the now famous Turing test 
for computer intelligence 25 The basic postulate is that rather than ask if computers can 
think, the more testable question is given a series of questions can an interrogator determine 
if the typewritten answers are coming from a human or a computer. A wealth early studies 
in the field can be found in the classic book Computers and Thoughts for more detailed 
background. 26 

Ironically, one of the first applications (expert systems) to use AI was the Dendral 
project that assisted with molecular structure identification based on mass spectroscopy 
data. 27, 28, 29 In the Dendral program was a collaboration initiated between Feigenbaum, 
Lederberg, Buchanan, and Djerassi to elucidate chemical structure at a high level of 
competence. Given a molecular formula, the spectrographs data, and encoded heuristic 
knowledge of organic chemists, the Dendral interactive program explores possible molecular 
configurations in the search for the true structure. The project helped elucidate some of the 
basic mechanisms of hypothesis generation and evaluation. The results of the project 
suggested that knowledge was as important as reasoning in these systems. In any case, 
today there are many examples where artificial intelligence has been used to generate expert 
systems with varying degrees of success. 

Expert systems attempt to replicate the decision making process of a human expert in 
a limited field. It consists of three components: a knowledge base, decision rules, and an 
inference engine. 30 As with all developing technologies, without an appropriate problem 
they are academic endeavors. The real utility involves how a technology (in this case 
evolving technologies of artificial intelligence and combinatorial chemistry) is applied to a 
specific problem. We have a unique set of interrelated problems that can be solved with 
these technologies leading to efficiencies to society, initially in the field of drug discovery, 
that are not currently obvious. 

There have been a number of current advances that allow us to create products not 
previously available. In particular, because of the nature of the problem we can time the 
development of future products with the development of both growing fields. A few 
examples of current advances relevant to the problems include but are not limited to web- 
industry technology, 31 multi-agent teams, 32 the human/computer interfaces, 33 and inductive 
learning in flexible hypthesis space. 34 These trends have implications for overcoming the 
challenges to feedback-directed optimization in software database development. Current 
advances in the reasoning component will receive human checks and balances in the 
development of a new types of scientific databases linked to services. This.. . 

D. The Proposal: 

To build a system that grows information databases and searches information 
databases with a team of scientists and artificial intelligence capabilities starting with 



combinatorial chemistry (the most general and reliable synthetic methods) and having 
feedback loops that help both groups learn. To maintain relevance at each step the 
information must be evaluated and corrected by humans and computer programs until it is 
evident which is better suited for which specific tasks. This could be done with crossover of 
information so both teams collaborate and compete. At every stage when practical the 
information system will be tied to real services (for example, the synthesis of libraries and 
individual compounds). 

E. Network approaches and open systems for information and services 

In the long run, the key to expanding the systems will be to make it open and 
compatible with outside parties. This approach is applicable to developing the database, 
developing the search engines, developing the basic technologies (both AI and chemical), 
and interacting with other databases. The epitome of this approach is the attraction of the 
Internet, In our case, there will be third party contributions to content, basic research, and 
software architecture. At a certain point there will be a positive spiral that creates uniquely 
evolving database information products (initially in chemistry). Part of the architecture of 
the suite of products will be a filter of suggestions and additional contributions. 

F. The possibility for multiple patents: 

Although the stated purpose of this document is "to build a system that grows 
information databases and searches information databases with a team of scientists and 
artificial intelligence capabilities starting with combinatorial chemistry (the most general 
and reliable synthetic methods) and having feedback loops that help both groups learn. At 
every stage when practical the information system will be tied to real services (for example, 
the synthesis of libraries and individual compounds)." This does not in any way limit the 
potential claims and future applications. The information contained in this document set 
the foundation for multiple possible patents, including but not limited to: 

1 . Patents related to the organization of chemical (and other) information 

2. Patents related to the evolution of the organization of chemical (and other) 
information 

3 . Patents on the search capabilities of the chemical database 

4. Patents related to the organization of precursors and selection (i.e. 
software, sets, etc.) 

5 . Patents on the methods for tying flexible services to these patents 
These and other applications are described in detail in the following sections. 

II. Building a scaleable architecture for synthetic and related databases 
A. Current efforts with static databases 

A book is a static database. It contains information, but has no ability to evolve its 
internal structure once the book is printed. An intermediate level is a software database that 
provides the user with a number of options to choose from and a number of possible answers 
to queries. A more advanced expert system would continuously evolve based on feedback 
loops. The Electronic Combinatorial Index database can evolve into something much 
greater than the initial static product. Procedures for combinatorial chemistry represent a 
subset of all synthesis, procedures for synthesis represent a cross section of procedures for 
drug discovery, procedures for drug discovery are a cross section of all chemistry and 
biology. The details of when and how the connections are made are critical to the growth of 
the producl(s). The architecture will contain procedures (information) and deliverables 
(services) at multiple stages. The internal connections will become stronger as they are used 



and the number of external connections will increase much like the development of an 
embryonic brain. 

B . Our structure for evolving databases: 

The central position will be a set of reliable methods for high-throughput 
combinatorial synthesis. This is analogous to the relationship combinatorial synthesis has to 
all synthetic methods. As previously mentioned, combinatorial synthesis represents the 
most general, expedient and robust synthetic methods because by their very nature they 
should be tolerant of a range of functional groups. While this central position will 
inevitably grow over time as additional scientists publish on combinatorial methods, the real 
growth in the database will be a result of the tentacles that reach into the more mature field 
of chemical synthesis as rather broadly defined. 

The second position will include the most widely used chemistry referenced books 
judiciously selected. The procedures from the leading references will be included along with 
their lineage to the more general high-throughput synthesis methods. Examples of 
appropriate reference works for the second circle include, but are not limited to March's 
Organic Synthesis text, Green and Wuts' Protecting Groups in Organic Synthesis, Bodansky 
and Bodanskys' Practice of Peptide Synthesis, The Encyclopedia of Organic Reagents. All 
of the facts and information from texts will be reformatted in a uniform simple manner for 
easy access by humans and computer programs and to avoid any copyright restrictions. 
Other reference texts that will be added to the second circle, include but are not limited to 
Organic Synthesis, Organic Reaction, Org. Syn Prep., and Comprehensive Heterocyclic 
Chemistry. 

The third circle will include the leading references from the first two circles. When 
appropriate, this will lead to a chain of articles as commonly done when an individual learns 
about a new field while spending time in a chemistry library. 

The fourth circle will actually start to include a systematic reorganization of the 
entire chemical literature in the journals. Reorganizing the information in a common 
format will have obvious advantages for the end user. This is similar to chem abstracts 
except that it will emphasize procedures, tye them to services, and make them part of an 
evolving database. 

Once the inner circles reach a certain mass inertia it will be time to incorporate other 
databases. This is why it is critical to develop very robust search agents at the early stage of 
the project that are able to search other databases and vice-versa. The key to utility will be 
in the organization of the material and the simplicity (with enormous flexibility) of the 
search engine. 

The methods for organization, representation, and uses of databases described herein 
should be applicable to both related and unrelated fields of knowledge. 

C. Representation of information: 

It will be important to create a simple, uniform format with actual procedures and 
reliability ratings that are simple enough for most humans to easily use yet robust enough 
for an artificial intelligence computer program to work with and evolve additional tasks, 
information, and networks 

D. Detailed description of a system that can evolve to include and work with: 

1 . Related fields such as structural biology and drug discovery. 

2. Unrelated fields such as economics and psychology database. 

We are creating a chemo-informatics tool that is as easy and simple to use while still 
being scaleable and tied to services whenever possible. It will be an appropriate real life 
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valuable problem to evolve various competing artificial intelligence technologies and 
approaches. It will be an open system that is compatible with other databases and 
information searching methods. It will include agents to search for information that are 
guided by both an artificial intelligence program and a 24 hour available human expert in 
chemical database mining. 

The chemo-informatics tool includes the most valuable information such as synthetic 
procedures with reliability ratings based on experimental data. All information will be one 
click away from a "shared risk" feasibility study and custom services. The services include 
but are not limited to the delivery of individual compounds, small libraries (bookends of 10- 
100), and large libraries (1000-100,000). With the right infrastructure, systems, and people 
we will routinely deliver custom services faster. 

With our general diversity and targeted libraries we will include first generation 
actual and virtual libraries. The libraries will come with software for SAR trees describing 
follow-on libraries. This will include the option of using other software and biasing the 
SAR trees describing follow-on libraries with known structural information. These will be 
tools that can be used in any laboratory around the world. For example, each set of 2000 
compounds (2D library) and 6000 compounds (3D library) on average will come with an 
option for a second generation library based on the SAR or a full blown-out library (perhaps 
in a split and mix format) if the customer wishes to hide the SAR. Both human and AI 
teams would collaborate and compete on the reaction condition optimization and SAR 
optimization problems. 

The informatics package will be linked to a network of cross references to suggest 
related sets of compounds for synthesis. For example, a section from March's Organic 
Chemistry text might lead to series of papers that suggest a particular set of reaction 
conditions. Both the computer and humans will find patterns of "leading references". The 
key will be, over time, to include reliability ratings. The data output should include 
references, words, and schemes as provided by current chemical databases such as Bielstien 
while including addition outputs such as procedures and the potential to have a compound 
described in the literature made for you and delivered (following shared-risk feasibility). 
The customer would have the option of using our information, using our synthetic services, 
or a combination of the two offerings. 

The organization based services would be initially targeted at the synthetic chemist. 
The value of the services would be proportional to the time, scale, and difficulty. Offerings 
would include but are not limited to "one step from Aldrich, three steps from Lancaster, two 
steps in Bielstien" for example. Reliability ratings will help with assessing difficulty. 

A key component will be to have the structural and reaction infomration stored in a 
uniform manner so both chemists and AI programs get used to improving the network and 
making connections. Each time a human expert finds a new connection (s)he will notify the 
computer program and vice versa. The network technology will evolve on the coattails of 
web technology currently being paid for by Yahoo, Inkotomi, Googgle, and others in 
addition to in-house expertise. 

The information and database services could be extended to include analytical data 
(theoretical and/or experimental), structural data on the molecules, biostructural data for 
more complex problems, chemical ordering information, and web-offerings. These are all 
related fields under the umbrella of drug discovery. 

The information and database service could be extended to partially related fields 
like polymer synthesis and enzymology. A truly robust system could also be extended to 
unrelated fields. This approach could be used, with the judicious selection of starting points 
and experts, to organize medicine in a standard and simple format. The approach could be 
applied to any other field where significant bodies of information our guided by underlying 
logical principles. Obvious examples include psychology, law, engineering, archetecture, 
journalism, economics, history, business, electronics, and the internet to mention some 
possibilities. To do this practically, since we will be late players in these fields, will require 



a strong technology base using the state of the art in both human experts and artificial 
intelligence. 

The guiding principles would be the same as those used in organizing the synthetic 
information. Namely, identify the most valuable content (such as synthetic procedures) and 
develop a uniform method of representation. For example, instead of randomly surfing the 
Internet, use peer reviewed content from scientific journals and books. We will carefully 
select the content to develop intuitive products. Finally, we will continue to offer services 
either by building in-house expertise or partnering if and when appropriate. 

E. Examples of evolved utility with open systems 

There are a number of offerings that have demonstrated the utility of open systems that leverage 
the efforts of others. In chemistry, Sciquest, Chemdex, Cambridge Soft's web portals are more recent 
examples. Historically, the Aldrich chemical catalog is a traditional example whereby chemicals are 
bought elsewhere and rebottled or made in house with the same result for the customer. To the extent 
possible, the Libraria flexible service offering will take into account a range of requests and then search 
the world's database of synthetic information for appropriate solutions. 

III. Detailed description for tying chemical services to the information databases 

Because we will be reorganizing the chemical information as it is added to the database, although 
it will be an ambitious task but we will be able to pick and choose what to include and to assign reliability 
ratings (either based on theoretical or actually experimental data). Some of this information will be 
obtained from third party laboratories. 

Related to the concept of creating a uniquely consistent format for organizing chemical 
information is the ability to develop a system to tie a flexible range of services that are not currently 
possible to this information. As one example, instead of having a follow-on library with just the same 
reactions (which is the most straightforward approach both scientifically and logistically) have a software 
program(s) and/or human(s) select from a range of chemistries with reliability ratings either as diverse or 
targeted sets of library products and services. 

Another way to tie information to services is to organize flexible sets of precursors based on 
structural criteria in the literature (see, Lipinski, C. et al and Murcko, M. et al). This will be organized in 
a uniform, yet uniquely flexible organization of the data input variables for library generation (both for 
individual libraries or groups of libraries as described in the first example). The precursors can be 
organized and barcoded in rows and columns of a grid such as the industry standard 96-well microtitor 
plate. Additional information could be added to the precursor sets piror to internal or external use 
including but not limited to solubility data, reliability ratings '(+/- or 1-10), aromatic/aliphatic, 
diverse/targeted, hydrophobic/hydrophilic, alpruVbeta substituted, o,m,p-substituted, ring size, 
bicyclic/fused, etc. Customers would have the option of using the software tools or their own in the 
selection process. They could test the chemistry in their own laboratory (one place) with a subset of 
precursors and then generate the full library in our laboratory (second place) because of the careful 
preorganiation and preselection of appropriate data input precursors. These preorganization of these sets 
of precursors would facilitate "shared risk" rapid feasibility studies on precursor compatibility with new 
chemistries. Finally it would facilitate library design, feasibility, and generation functions. 

The preorganization of both synthetic information and structurally relevant precursor sets will 
allow for more flexible chemical services. These could range from individual compound synthesis, to 
small sets (bookends), to large sets (full libraries). The key will be to build up a critical mass of 
intelligently organized, flexible chemical information (both in computers and people) to offer the greatest 
range of services. The guiding principles could be applied to other related and unrelated fields. 

I. Network approach with open systems to develop: 

A. New applicable chemistries 

B . New Al applicable technologies 
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C. Unique groups of experts to interact with the evolving database of information 
and to suggest improvements/additions after each use (perhaps for a discounted 
price or similar benefit, etc.). 

To truly realize the full potential of this approach would require a substantial effort to create 
industry standards with open systems and then to find ways to work with other experts in their various 
fields. For chemistries this would include experts from acadamia as well as combinatorial chemistry 
laboratories. A similar network could help evolve the artificial intelligence applications and technologies 
once a robust framework for programming is created. Finally, the extent to which the system tracks use 
and has feedback loops based on standard representations of data is the extent to which the system can 
evolve. 

List of possible alliances: 

For combichem: 

Axys 

DPI 

Trega 

ArQule 

Cambridge Combinatorial 

Molecumetics 

NCE 

3D Pharmaceuticals 

Chemcodes 

UC Berkeley 

Harvard 

Stanford 

For webstrategy: 
ChemNavigator 
ChemRoutes 

For software development: 

Scivision 

Tripos 

Stanford 

U of T.Austin 

For web logistics: 
Sciquest 
Chemdex 
Cambridge Soft 

For building blocks: 
Aldrich 
Argonaut 
Lancaster 

For instrumentation: 
Robbins/Genevac 
Hewlett-Packard 
Berger Instruments 
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Big Pharma (especially Merck) 



Contact: 

Dr. Barry A. Bunin 
Tel 650-873-7704 
Fax 650-873-7707 
bunin@combinatorial.com 
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